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We Claim: 



1 . A method for inferring a gene network, comprising 

(a) providing an inferential model of possible gene networks of an organism 
including defining a search space; 
10 (b) selecting a biologically relevant subspace of said search space; and 

(c) calculating an optimal solution in said selected subspace by repeatedly 
applying an algorithm that computes small gene networks optimally. 

2. The method of claim 1, wherein said inferential model is a Bayesian network 
1 5 estimation model. 



3. The method of claim 1 , wherein said biologically relevaiit subspace includes genes 
relating to a metabolic pathway of said organism. 

20 4. The method of claim 1 , wherein said algorithm comprises the steps: 

(a) compute F(g,0) = s{g,<P) for all g e G; 

(b) for all A^G, A^<p and all g e G compute F(g, ^)as 
min {s(g^),mm^^AF(gyA- {a})}; 

(c) set Mi0)=(p; 

25 (d) for all A^G, A^ (p,dofhc following steps: 

(i) compute g* =arg min^A (F(g,A-{g})-^Q''^^\MiA-{g}))); and 

(ii) forall l</<|^LsetM(.4)(0=M(^-{g*})(0,and A/(.4)(|^^ =^ 

*; and 

(e) retum Q^(M(G)). 
30 5. The method of claim 4, wherein said algorithm is modified according to the steps of: 
(a) in the computation of F in Step 1 and Step 2, compute only F(g, A) for all 
g 6 Si and all >4cCg; and 
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5 (b). replace the term F{g, A - {g}) in Step 4a by i^(g, (Q = Sd U (Qfl ^))- 

6. The method of claim 1 , wherein an optimal network A^has a definition: score(N) = 

10 7. The method of claim 1 , wherein said algorithm comprises the steps: 

(a) cluster genes in G such that no cluster is larger than c genes; 

(b) sort the clusters by decreasing size: Ci, . . . , C„; 

(c) for each i e {I, . . , , n} and for each g e Q, select up to m candidate 
parents fi*om Ci U ... U 

1 5 (d) compute an optimal gene network model using Theorem 1 .2. 

8. The method of claim 1 , wherein said algorithm comprises the steps: 

(a) group genes in G in groups Q with | C, ] ^ c and sort them according to 
biological knowledge: Ci C„; 

20 (b) for each z 6 { 1 , . . . ,«} and for each gene g e C, select up to m candidate 

parents fi-om Ci U ... U and 

(c) compute an optimal gene network model using Theorem 2. 

9. The method of claim 1 , wherein said algorithm comprises the steps: 
25 (a) compute F(g, (p) = s{g, (p) for all geG\ 

(b) for all A^Gy A^ <p and all g e G compute F(g, A) as 
Tr)m{s(g,A),mm^^AF(g^-{a})}; 

(cc) setA/(0)=^, 

(d) for all Aq,G,A^ (p , do the following two steps: 

30 (i) computeg* =arg min^A {F(gA-{g})+&^'''^\M{A-{g})))\ and 

(ii) Forall 1 </ < MLsetM(^)(z)=M(^-{g*})(0,^^ =g 

*; and 

(e) return Q%M{G)). 



10. 



The method of claim 1, wherein said algorithm comprises the steps: 
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5 (a) set F"(g,(pA) = 0, Sr(g,(p,l) = s(g,<p) for all g e G; 

(b) for all ge G, all A^G, A ^Nand Mn^m do (he following two steps: 

(i) selects* from {B^\B=AvB=F^(g,A-{h} j?), he A, p^m} - 
{F"(g,A,p)\p<n} such that s(gyB*) is minimized; and 

(ii) set ]^(g^,n)=B\ 5"(g^,/i)=5fe3*); 
10 (c) setA/"(<?>,l)=^and£)'"(0,l)=^, 

(d) for all As^Gy (p, and all n<.m do the following three steps: 

(i) choose a triple (g,p,g) eA x IN^m x IN^m such that 
scoreiQ'-^^\Ar(A-{g} ,p)J^"*{A-{g} ,g) is minimized and (g^,^) induces a 
network different from Q^(M"(A,r)jy"(A,r)) for r<M; 
15 (ii) setA/"(yl,w)(0=A/^(^-{g}^)(0 for/< |yl|,andAr(^n)(|^|)=g; 

and 

(iii) let V denote Z^^'C^- {g} ,/?). Set w e /A^^' as wrvi for all / 
< 1^ I and 1^ I = ^ and set lT{A,n) = w; and 

(e) return Q%M"{G,i)jr{G,i)) for all Urn. 

20 

11. The method of any of claims 1-10, wherein reliability of an enumerated gene 
network, comprising the steps: 

(a) enimierate the most likely gene network models A^, 1 ^i^n; 

(b) for every g, h 60G, count the occurences of the edge (g,h) in the networks 

25 Nil 

(c) select all edges (g,h) with at least c occurrences; 

(d) for all subsets M of the set of selected edges with \M\ = k, count the 
networks including all edges in MI; and 

(e) retum all motives Af with at least c occurences. 

30 

12. The method of any of claims 1-11, further comprising calculating a scoring function 
selected from the group consisting of BRNC score, BDe score and MDL score. 

35 13. A method for determining a gene network as substantially described herein. 



14. A storage medium containing results obtained using the method of any of claims 1-11. 
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5 15. A storage medium containing results obtained using a method as substantially 
described herein. 

16. A system for determining gene network relationships, comprising: 

an input device for providing quantitative expression data for genes of an organism; 
10 a storage device adapted to receive quantitative expression data for genes of said 

organism; 

a processor adapted to carryout a Bayesian network analysis of network relationships 
between said genes, thereby producing a data set reflecting said network relationships; and 
an output device for displaying said data set of said network relationships. 

15 

17. A system for determining gene network relationships as substantially described 
herein. 



